Phonetic Ambiguity : Approaches, Touchstones, Pitfalls and New Approaches

نویسنده

  • Patrick Juola
چکیده

Phonetic ambiguity and confusibility are bugbears for any form of bottom-up or data-driven approach to language processing. The question of when an input is “close enough” to a target word pervades the entire problem spaces of speech recognition, synthesis, language acquisition, speech compression, and language representation, but the variety of representations that have been applied are demonstrably inadequate to at least some aspects of the problem. This paper reviews this inadequacy by examining several touchstone models in phonetic ambiguity and relating them to the problems they were designed to solve. An good solution would be, among other things, efficient, accurate, precise, and universally applicable to representation of words, ideally usable as a “phonetic distance” metric for direct measurement of the “distance” between word or utterance pairs. None of the proposed models can provide a complete solution to the problem; in general, there is no algorithmic theory of phonetic distance. It is unclear whether this is a weakness of our representational technology or a more fundamental difficulty with the problem statement. In any case, these results show that the representations can be as crucial as the system architecture, and that as much or more creativity is required to properly represent language as to process it.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Touchstones, Pitfalls, and New Directions

Phonetic ambiguity and confusibility are bugbears for any form of bottom-up or data-driven approach to language processing. The question of when an input is “close enough” to a target word pervades the entire problem spaces of speech recognition, synthesis, language acquisition, speech compression, and language representation, but the variety of representations that have been applied are demons...

متن کامل

Analysis of Phonetic Matching Approaches for Indic Languages

Phonetic matching plays an important role in multilingual information retrieval, where data is manipulated in multiple languages. User needs information in their local language which may be different from the language where data has been maintained. In such an environment, we need a system which matches the strings phonetically irrespective of errors either exactly or approximately. There are m...

متن کامل

بررسی مقایسه‌ای نظام قدیم و جدید آموزش مدیران و کارکنان دولت با نگاه استراتژیک

While reviewing the history and importance of traning state executives and employees, this article surveys the old and new on- the- job training for the said groups. The study primarily reviews the objectives and plans of each approach. Then the pitfalls of the first approach are enumerated, as studied by the State Management and Planning Organization's research team. The two approaches are com...

متن کامل

Grey theory, VIKOR and TOPSIS Approaches

Abstract This author introduces the concept of Stepwise Strategy Approach (SSA) for dealing with a number of problems arises in the current age of technology. This new idea is combined with the knowledge of Grey Theory for adding flexibility to decision making process. Grey theory is useful for grasping the ambiguity exists in the utilized information and the fuzziness appears in the human judg...

متن کامل

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره cmp-lg/9608020  شماره 

صفحات  -

تاریخ انتشار 1996